Generating videos of media items associated with a user

ABSTRACT

A method includes grouping media items associated with a user into segments based on a timestamp associated with each media item and a total number of media items. The method also includes selecting target media from the media items for each of the segments based on media attributes associated with the media item. The method also includes generating a video that includes the target media for each of the segments by generating a first animation that illustrates a first transition from a first item from the target media to a second item from the target media with movement of the first item from an onscreen location to an offscreen location, wherein the first item is adjacent to the second item in the first animation and determining whether the target media includes one or more additional items. The method also includes adding a song to the video.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is a continuation of U.S. patent applicationSer. No. 15/633,011, filed Jun. 26, 2017 and titled GENERATING VIDEOS OFMEDIA ITEMS ASSOCIATED WITH A USER, which is a continuation of U.S.patent application Ser. No. 14/885,285, filed Oct. 16, 2015 and titledGENERATING VIDEOS OF MEDIA ITEMS ASSOCIATED WITH A USER (now U.S. Pat.No. 9,691,431), the contents of which are incorporated herein byreference in its entirety.

BACKGROUND

People enjoy looking at media items that they have previously posted onwebsites, such as social networking websites or video websites. Manualcuration of media items into a video or compilation may not be scalablewith large numbers of media. It may also be difficult to automaticallyselect items from the media to use for a compilation. For example, someevents may be overrepresented or underrepresented. In addition,depending on the number of media items associated with the user, it maybe computationally expensive to process all the media items according todifferent media criteria.

SUMMARY

Implementations generally relate to media items and videos. In someimplementations, a method to generate a video includes grouping mediaitems associated with a user into segments based on a timestampassociated with each media item and a total number of media items. Themethod may further include selecting target media from the media itemsfor each of the segments based on media attributes associated with themedia item, wherein the target media includes two or more media items.The method may further include generating the video that includes thetarget media for each of the segments by: generating a first animationthat illustrates a first transition from a first item from the targetmedia to a second item from the target media with movement of the firstitem from an onscreen location to an offscreen location, wherein thefirst item is illustrated as being attached to the second item in thefirst animation, determining whether the target media includes one ormore additional items, and responsive to determining that the targetmedia includes one or more additional items, generating a secondanimation that illustrates a second transition from the second item to afirst additional item and that further illustrates additionaltransitions between a remainder of the one or more additional items. Themethod may further include adding a song to the video, wherein one ormore features of the song are synchronized with the first transition andthe second transition.

In some implementations, the method may further include the timestampbeing associated with each media item represents at least one of anupload time associated with uploading each media item, a capture timeassociated with capturing each media item, a creation time associatedwith creating each media item, and a last modification made to eachmedia item. The method may further include selecting the media itemsfrom media associated with the user based on the timestamp beingindicative of a particular time period. Generating the first animationmay further include illustrating a zoom-out effect on the first item,wherein the zoom-out effect is synchronized to the one or more featuresof the song. The method may further include grouping the media itemassociated with the user into the plurality of segments by groupingmedia items associated with an event into two or more segments based onan event number of media items associated with the event as compared tothe total number of media items. The method may further includeselecting the target media from the media item for each of the pluralityof segments based on the media attributes associated with the media itemincludes selecting based on a share tendency. The method may furtherinclude a number of target media being a function of the total number ofmedia items as determined prior to the grouping of the media items.

In some implementations, a system to generate a video includes one ormore processors coupled to a memory, a media selector module, and avideo generation module. The media selector module may be stored in thememory and executable by the one or more processors to select targetmedia from media items by, for each of the media items: randomlyselecting a first rule from a plurality of rules, determining whetherthe media item satisfies the first rule, responsive to a determinationthat the media item does not satisfy the first rule, continuing toselect a subsequent rule from the plurality of rules until thesubsequent rule is satisfied or the plurality of rules are exhausted,and determining that the media item is a target media item if the mediaitem satisfied the first rule or the subsequent rule. The videogeneration module may be stored in the memory, coupled to the mediaselector module, and executable by the one or more processors togenerate the video that includes the target media.

In some implementations, the video generation module is configured togenerate the video by: generating two or more animations, wherein eachanimation illustrates a transition between two media items of the targetmedia items and arranging the two or more animations into a sequence.The system may further include a music module stored on the memory andexecutable by the one or more processors, the music module configured toadd a song to the video and synchronize one or more features of the songwith the transitions in the two or more animations. The media selectormodule may be configured to select the first rule from the plurality ofrules that include two or more of an important entity rule, a landmarkrule, a face frequency rule, a face affinity rule, and a highlightsrule. The video generation module may be configured to generate a coverimage that is target media that appears first in the video and whereinthe video generation module identifies the cover image by determiningwhether the cover image satisfies at least the first rule and thesubsequent rule.

In some implementations, the system may further include a user interfacemodule stored on the memory and executable by the one or moreprocessors, the user interface module configured to provide the userwith a user interface that includes options for performing one or moreof adding a user-specified media item to the target media, removing oneor more media items from the target media, specifying an order of thetarget media in the video, specifying a title of the video, andselecting the song. The user interface module may also be configured togenerate a preview of the video from at least one of the first item inthe video and subsequent items in the video that are static images. Theuser interface module may also be configured to generate a link to anetwork location that, when selected by the user, provides the user withaccess to the video.

In some implementations, an apparatus to generate a video includes meansfor grouping media items associated with a user into segments based on atimestamp associated with each media item and a total number of mediaitems, means for selecting target media from the media items for each ofthe segments based on media attributes associated with the media item,wherein the target media includes two or more media items, means forgenerating the video that includes the target media for each of thesegments by: generating a first animation that illustrates a firsttransition from a first item from the target media to a second item fromthe target media with movement of the first item from an onscreenlocation to an offscreen location, wherein the first item is illustratedas being attached to the second item in the first animation, means fordetermining whether the target media includes one or more additionalitems, responsive to determining that the target media includes one ormore additional items, means for generating a second animation thatillustrates a second transition from the second item to a firstadditional item and that further illustrates additional transitionsbetween a remainder of the one or more additional items, and means foradding a song to the video, wherein one or more features of the song aresynchronized with the first transition and the second transition.

In some implementations, the apparatus may further include the timestampbeing associated with each media item represents at least one of anupload time associated with uploading each media item, a capture timeassociated with capturing each media item, a creation time associatedwith creating each media item, and a last modification made to eachmedia item. The apparatus may further include means for selecting themedia items from media associated with the user based on the timestampbeing indicative of a particular time period. Generating the firstanimation may further include means for illustrating a zoom-out effecton the first item, wherein the zoom-out effect is synchronized to theone or more features of the song. The apparatus may further includemeans for grouping the media item associated with the user into theplurality of segments by grouping media items associated with an eventinto two or more segments based on an event number of media itemsassociated with the event as compared to the total number of mediaitems. The apparatus may further include means for selecting the targetmedia from the media item for each of the plurality of segments based onthe media attributes associated with the media item includes selectingbased on a share tendency. The apparatus may further include a number oftarget media being a function of the total number of media items asdetermined prior to the grouping of the media items.

Other aspects may include corresponding methods, systems, apparatus, andcomputer program products.

The system and methods described below advantageously generate videos ofmedia items that a user is expected to enjoy viewing. In addition, byrandomly selecting a rule to apply to media items instead of applyingmultiple rules at once, the system and methods increase the chances ofcreating an interesting video. By synchronizing the video with featuresof a song, such as beats of a song, the video may be more captivating toaudiences. Lastly, the system and methods advantageously providecoverage of content spanning a long time period and reduce thecomputational cost of generating the video by selecting target media foreach segment independently.

BRIEF DESCRIPTION OF THE DRAWINGS

The disclosure is illustrated by way of example, and not by way oflimitation in the figures of the accompanying drawings in which likereference numerals are used to refer to similar elements.

FIG. 1 illustrates a block diagram of an example system that generates avideo.

FIG. 2 illustrates a block diagram of an example computing device thatgenerates a video.

FIG. 3A illustrates a graphic representation of a first portion of ananimation.

FIG. 3B illustrates a graphic representation of a second portion of theanimation.

FIG. 3C illustrates a graphic representation of a third portion of theanimation.

FIG. 3D illustrates a graphic representation of a fourth portion of theanimation.

FIG. 4 is a flowchart of an example method to generate a video.

FIG. 5 is a flowchart of an example method to select target media frommedia items.

FIG. 6 is a flowchart of an example method to generate an animation withtwo or more transitions.

DETAILED DESCRIPTION

Some implementations may include a system and method to generate avideo. Media items associated with a user may be grouped into segmentsbased on a timestamp associated with each media item and a total numberof media items. For example, if there are 72 photos available that areevenly distributed over a particular time period, the media items may begrouped into 12 segments where each segment includes six of the mediaitems. Target media may be selected from the media items for each of thesegments based on media attributes associated with the media item. Forexample, for each segment, two media items may be selected from the sixmedia items available for that particular segment.

In some implementations, the target media may be selected based onapplying a rule to each media item that is randomly selected from aplurality of rules. For example, the first rule selected may determinewhether the media item includes an important entity, such as a birthdaycake. If the media item does not satisfy the rule, a subsequent rule maybe selected from the plurality of rules. For example, the subsequentrule may determine whether the media item includes several facesincluding the user's face. If there is no subsequent rule available, theprocess may proceed to a next media item. Alternatively, the targetmedia may be selected based on a tendency of users to share the images.

A video may be generated that includes the target media for each of thesegments. Generating the video may include generating a first animationthat illustrates a first transition from a first item from the targetmedia to a second item from the target media with movement of the firstitem from an onscreen location to an offscreen location, where the firstitem is adjacent to the second item in the first animation. In someimplementations, the video may include additional transitions withadditional items in additional animations.

In some implementations, a song may be added to the video where one ormore features of the song are synchronized with the video. For example,beats of the song may be synchronized with the transitions in the video.

Example System

FIG. 1 illustrates a block diagram of an example system 100 thatgenerates videos. The illustrated system 100 includes a server 101, userdevices 115 a, 115 n and a network 105. Users 125 a-125 n may beassociated with respective user devices 115 a, 115 n. In someimplementations, the system 100 may include other servers or devices notshown in FIG. 1. For example, the system 100 may include a separatesocial network server, a music server, etc. In FIG. 1 and the remainingfigures, a letter after a reference number, e.g., “115 a,” represents areference to the element having that particular reference number. Areference number in the text without a following letter, e.g., “115,”represents a general reference to implementations of the element bearingthat reference number.

In the illustrated implementation, the entities of the system 100 arecommunicatively coupled via a network 105. The network 105 may be aconventional type, wired or wireless, and may have numerous differentconfigurations including a star configuration, token ring configurationor other configurations. Furthermore, the network 105 may include alocal area network (LAN), a wide area network (WAN) (e.g., theInternet), and/or other interconnected data paths across which multipledevices may communicate. In some implementations, the network 105 may bea peer-to-peer network. The network 105 may also be coupled to orinclude portions of a telecommunications network for sending data in avariety of different communication protocols. In some implementations,the network 105 includes Bluetooth® communication networks or a cellularcommunications network for sending and receiving data including viashort messaging service (SMS), multimedia messaging service (MMS),hypertext transfer protocol (HTTP), direct data connection, wirelessapplication protocol (WAP), email, etc. Although FIG. 1 illustrates onenetwork 105 coupled to the user devices 115 and the server 101, inpractice one or more networks 105 may be coupled to these entities.

The server 101 may include a processor, a memory and networkcommunication capabilities. In some implementations, the server 101 is ahardware server 101. The server 101 is communicatively coupled to thenetwork 105 via signal line 102. Signal line 102 may be a wiredconnection, such as Ethernet, coaxial cable, fiber-optic cable, etc., ora wireless connection, such as Wi-Fi, Bluetooth, or other wirelesstechnology.

In some implementations, the server 101 sends and receives data to andfrom one or more of the user devices 115 a-115 n via the network 105.The server 101 may include a video application 103 a and a database 199.

The video application 103 may be code and routines configured togenerate videos. In some implementations, the video application 103 maybe implemented using hardware including a field-programmable gate array(FPGA) or an application-specific integrated circuit (ASIC). In someimplementations, the video application 103 may be implemented using acombination of hardware and software.

The database 199 may store videos generated by a video application 103,such as the video application 103 a stored on the server 101 or thevideo application 103 b stored on the user device 125 a. The database199 may also store social network data associated with users 125, songs,etc.

The user device 115 may be a computing device that includes a memory anda hardware processor, for example a laptop computer, a desktop computer,a tablet computer, a mobile telephone, a wearable device, a head-mounteddisplay, a mobile email device, a portable game player, a portable musicplayer, a reader device, a television with one or more processorsembedded therein or coupled thereto, or other electronic device capableof accessing a network 105.

In the illustrated implementation, user device 115 a is coupled to thenetwork 105 via signal line 108 and user device 115 n is coupled to thenetwork 105 via signal line 110. Signal lines 108 and 110 may be a wiredconnection, such as Ethernet, coaxial cable, fiber-optic cable, etc., ora wireless connection, such as Wi-Fi, Bluetooth, or other wirelesstechnology. User devices 115 a, 115 n are accessed by users 125 a, 125n, respectively. The user devices 115 a, 115 n in FIG. 1 are used by wayof example. While FIG. 1 illustrates two user devices, 115 a and 115 n,the disclosure applies to a system architecture having one or more userdevices 115.

In some implementations, the user device 115 can be a mobile device thatis included in a wearable device worn by the user 125. For example, theuser device 115 is included as part of a clip (e.g., a wristband), partof jewelry, or part of a pair of glasses. In another example, the userdevice 115 can be a smart watch. The user 125 can view videos from thevideo application 103 on a display of the device worn by the user 125.For example, the user 125 can view the videos on a display of a smartwatch or a smart wristband.

In some implementations, the video application 103 b may be stored on auser device 115 a. The video application 103 may include a thin-clientvideo application 103 b stored on the user device 115 a and a videoapplication 103 a that is stored on the server 101. For example, thevideo application 103 a may be a web application that generates webpages viewable by the user device 115 a using the video application 103b. In various implementations, the video application 103 may include amobile application that runs on the user device 115 a and sendsinformation to the video application 103 a stored on the server 101. Forexample, the user 125 a may capture media using the user device 115 aand transmit the media to the server 101 for the video application 103a. The video application 103 a stored on the server 101 may process theinformation and send additional information back to the videoapplication 103 b stored on the user device 115 a. For example, thevideo application 103 a may generate a video from the media and transmitthe video to the user device 115 a for display. The user device 115 amay stream the video directly from the server 101 or store the video onthe user device 115 a.

In some implementations, the video application 103 may be a standaloneapplication stored on the server 101. A user 125 a may access the webpages using a browser or other software on the user device 125 a. Inthis implementation, the video application 103 b stored on the userdevice 115 a may receive instructions from the video application 103 astored on the server 101 to display information generated by the videoapplication 103 a. In some implementations, the video application 103may include the same components on the user device 115 a as are includedon the server 101. In these implementations, a video may be generatedfrom media items by the server 101 or by the user device 115. In theseimplementations, a video may be generated from media items by the server101 or by the user device 115.

Example Computing Device

FIG. 2 illustrates a block diagram of an example computing device 200that generates videos. The computing device 200 may be a server 101 or auser device 115. The computing device 200 may include a processor 235, amemory 237, a communication unit 239, a display 241, and a storagedevice 243. A video application 103 may be stored in the memory 237. Thecomponents of the computing device 200 may be communicatively coupled bya bus 220.

The processor 235 includes an arithmetic logic unit, a microprocessor, ageneral purpose controller or some other processor array to performcomputations and provide instructions to a display device. Processor 235processes data and may include various computing architectures includinga complex instruction set computer (CISC) architecture, a reducedinstruction set computer (RISC) architecture, or an architectureimplementing a combination of instruction sets. Although FIG. 2 includesa single processor 235, multiple processors 235 may be included. Otherprocessors, operating systems, sensors, displays and physicalconfigurations may be part of the computing device 200. The processor235 is coupled to the bus 220 for communication with the othercomponents via signal line 222.

The memory 237 stores instructions that may be executed by the processor235 and/or data. The instructions may include code for performing thetechniques described herein. The memory 237 may be a dynamic randomaccess memory (DRAM) device, a static RAM, or some other memory device.In some implementations, the memory 237 also includes a non-volatilememory, such as a (SRAM) device or flash memory, or similar permanentstorage device and media including a hard disk drive, a floppy diskdrive, a compact disc read only memory (CD-ROM) device, a DVD-ROMdevice, a DVD-RAM device, a DVD-RW device, a flash memory device, orsome other mass storage device for storing information on a morepermanent basis. The memory 237 includes code and routines configured toexecute the video application 103, which is described in greater detailbelow. The memory 237 is coupled to the bus 220 for communication withthe other components via signal line 224.

The communication unit 239 transmits and receives data to and from atleast one of the user device 115 and the server 101 depending upon wherethe video application 103 may be stored. In some implementations, thecommunication unit 239 includes a port for direct physical connection tothe network 105 or to another communication channel. For example, thecommunication unit 239 includes a universal serial bus (USB), securedigital (SD), category 5 cable (CAT-5) or similar port for wiredcommunication with the user device 115 or the server 101, depending onwhere the video application 103 may be stored. In some implementations,the communication unit 239 includes a wireless transceiver forexchanging data with the user device 115, server 101, or othercommunication channels using one or more wireless communication methods,including IEEE 802.11, IEEE 802.16, Bluetooth® or another suitablewireless communication method. The communication unit 239 is coupled tothe bus 220 for communication with the other components via signal line226.

In some implementations, the communication unit 239 includes a cellularcommunications transceiver for sending and receiving data over acellular communications network including via short messaging service(SMS), multimedia messaging service (MMS), hypertext transfer protocol(HTTP), direct data connection, WAP, e-mail or another suitable type ofelectronic communication. In some implementations, the communicationunit 239 includes a wired port and a wireless transceiver. Thecommunication unit 239 also provides other conventional connections tothe network 105 for distribution of files and/or media objects usingstandard network protocols including, but not limited to, UDP, TCP/IP,HTTP, HTTPS, SMTP, SPDY, QUIC, etc.

The display 241 may include hardware configured to display graphicaldata received from the video application 103. For example, the display241 may render graphics to display a user interface that includes avideo. The display 241 is coupled to the bus 220 for communication withthe other components via signal line 228. Other hardware components thatprovide information to a user may be included as part of the computingdevice 200. For example, the computing device 200 may include a speakerfor audio interfaces, a vibration or force feedback device, or othertypes of non-display output devices. In some implementations, such aswhere the computing device 200 is a server 101, the display 241 may beoptional. In some implementations, the computing device 200 may notinclude all the components. In implementations where the computingdevice 200 is a wearable device, the computing device 200 may notinclude storage device 243. In some implementations, the computingdevice 200 may include other components not listed here, e.g., one ormore cameras, sensors, battery, etc.

The storage device 243 may be a non-transitory computer-readable storagemedium that stores data that provides the functionality describedherein. In implementations where the computing device 200 is the server101, the storage device 243 may include the database 199 in FIG. 1. Thestorage device 243 may be a DRAM device, a SRAM device, flash memory orsome other memory device. In some implementations, the storage device243 also includes a non-volatile memory or similar permanent storagedevice and media including a hard disk drive, a floppy disk drive, aCD-ROM device, a DVD-ROM device, a DVD-RAM device, a DVD-RW device, aflash memory device, or some other mass storage device for storinginformation on a permanent basis. The storage device 243 is coupled tothe bus 220 for communication with the other components via signal line230.

In the illustrated implementation shown in FIG. 2, the video application103 includes a preselector module 202, a segmentation module 204, amedia selector module 206, a video generation module 208, a music module210, a social network module 212, and a user interface module 214. Othermodules and/or configurations are possible.

The preselector module 202 may be configured to receive and transmitdata. In some implementations, the preselector module 202 may be a setof instructions executable by the processor 235 to filter media items.In some implementations, the preselector module 202 may be stored in thememory 237 of the computing device 200 and can be accessible andexecutable by the processor 235.

In some implementations, the preselector module 202 may receive mediaitems. The preselector module 202 may receive the media items from auser, such as when the user captures media from a user device 115, fromother users that shared the media items with the user, for example, viaa social network operated by the social network module 212, tagged theuser in the media items, etc. Exemplary media items may include, but arenot limited to, an image, a video, an animated Graphics InterchangeFormat (GIF) file, a photo burst series of images, a face switchingseries of images, and a multi-zoom series of images. An exemplary photoburst may include an animation of a fast switch between images thatcreate a stop-motion effect. An exemplary face switching may be used toshow changes in a person's face over a particular period of time. Forexample, the face switching may use different pictures of a baby to showhow the baby has changed over the past year.

The preselector module 202 may filter the media items to removeduplicates and remove media items that fail to meet preselectioncriteria. The preselection criteria may include, for example, rules thatrequire people or landmarks to be in the images (as opposed to images ofdocuments, bills, receipts, etc.), rules against including screenshots,rules that require a threshold level of image quality, etc.

In some implementations, the preselector module 202 may determinewhether media items are suitable for being used as target media based oncropping the media items. For example, the preselector module 202 maycrop portions of images to identify the most salient regions of theimage at a particular aspect ratio. The preselector module 202 may cropthe images by cutting any combination of the top, bottom, left, or rightof the image to highlight the entities or people in the image. Ininstances where portrait images include people, the preselector module202 may draw bounding boxes around people in the images and determinewhether the portrait images may be cropped to be converted intolandscape images while keeping all the people in the image. In someimplementations, the preselector module 202 may discard portrait imageswhere the crop cuts out some of the people in the image.

The segmentation module 204 may be configured to group media itemsassociated with a user into segments. In some implementations, thesegmentation module 204 may be a set of instructions executable by theprocessor 235 to segment media items. In some implementations, thesegmentation module 204 may be stored in the memory 237 of the computingdevice 200 and can be accessible and executable by the processor 235.

In some implementations, the segmentation module 204 receives filteredmedia items from the preselector module 202. In some implementationswhere the preselector module 202 is not included in the videoapplication 103, the segmentation module 204 receives the media items.For example, the segmentation module 204 may receive the media items asdescribed above with reference to the preselector module 202.

The segmentation module 204 may group the media items associated with auser into segments based on a timestamp associated with each media itemand a total number of media items. The segmentation module 204 may usethe timestamp for each media item and the total number of media items toensure that there is sufficient time diversity of media items. Forexample, if the timestamps are evenly dispersed throughout a certaintime period, such as a year, the segmentation module 204 may divide themedia items into 12 segments where each event is associated with aparticular segment. Alternatively, if timestamps for the media itemsindicate that there are an abundance of media items centered around oneevent, the segmentation module 204 may divide the media items associatedwith one event into multiple segments. The segmentation module 204 maydetermine whether to group media items associated with an event into twoor more segments based on an event number of media items associated withthe event as compared to the total number of media items. For example,the segmentation module 204 may determine that the media items include a“Trip to Paris” event and because 200 of the 300 media items areassociated with the trip to Paris, the segmentation module 204 may groupsome media items associated with the trip to Paris into a first segmentthat includes the first two days in Paris based on the timestamps andgroup the other media items associated with the event into a secondsegment that includes the last two days in Paris based on thetimestamps.

Exemplary timestamps may be associated with, but are not limited to, anupload time associated with uploading each media item, a capture timeassociated with capturing each media item, a creation time associatedwith creating each media item, and a last modification made to eachmedia item. The segmentation module 204 may select media items frommedia associated with a user based on the timestamp associated with eachmedia item being indicative of a particular time period. Exemplary timeperiods may include, but are not limited to, a calendar year, a birthdayyear, a time period of an important event (e.g., a soccer tournament), atime period of a chapter of a person's life (a wedding and theassociated celebration), etc.

In some implementations, the segmentation module 204 may determine anumber of target media based on a total number of media items. Thesegmentation module 204 may determine the number of target media beforegrouping the media items into segments. For example, the segmentationmodule 204 may determine that the number of target media is half thetotal number of media items.

The media selector module 206 may be configured to select target mediafrom media items. In some implementations, the media selector module 206may be a set of instructions executable by the processor 235 to selecttarget media. In some implementations, the media selector module 206 maybe stored in the memory 237 of the computing device 200 and can beaccessible and executable by the processor 235.

In some implementations, the media selector module 206 selects targetmedia for each segment independently, which may advantageously ensuretime-diversity and reduce the time cost of selecting target media ascompared to selecting target media for each segment sequentially. Insome implementations, the media selector module 206 selects one to threeimages for each segment.

The media selector module 206 may select the target media from the mediaitems based on media attributes associated with the media item. In someimplementations, the media selector module 206 may use a training modelto determine how to score the media items based on a tendency of usersto share the images. For example, the media selector module 206 maycompare media that were shared with other users to media that were notshared to identify media attribute differences. Based on thedifferences, the media selector module 206 may create a model for how toassociate a share tendency score to media items.

In some implementations, the media selector module 206 may applydifferent rules to determine which media items should be selected astarget media based on the media attributes. The rules may includedifferent types of image processing steps. For example, in instanceswhere the user consents to the use of such data, the media selectormodule 206 may perform image recognition to identify people in the mediaitems. In some implementations, the media selector module on the rule.The media selector module 206 may determine whether the score exceeds athreshold, select the target media based on top scores, etc.

The rules may include an important entity rule that determines whetherone or more important entities are present in the media item (e.g.,based on experiences particular to the user, such as an image of awooden box that the user indicates is important or experiencesparticular to users in general, such as an image of a birthday cake or awedding picture). The rules may further include a landmark rule thatdetermines whether one or more important landmarks are present in themedia item (e.g., based on a popularity score associated with theentities, where the Eiffel Tower may be determined to be an importantlandmark). The rules may further include an important person rule thatdetermines whether one or more important people are in the media item(e.g., based on a face frequency score). The rules may further includean important event rule that determines whether the media itemcorresponds to an important event based on whether a predeterminednumber of photos were taken during a particular time period (e.g., atleast five photos were taken in an hour, based on correlating the mediawith an important date associated with the user, such as a birthday,anniversary, festival, trip, holiday, etc.). The rules may furtherinclude a high-visual quality rule that determines whether the mediaitem has a visual quality that is within or exceeds a quality threshold(e.g., based on a level of blurriness, dynamic range, etc.). The rulesmay further include a highlight rule that determines whether the mediaitem represents a highlight of an event. The rules may further include asocially engaged rule that determines whether one or more people in amedia item are socially engaged (e.g., whether multiple people arelooking at each other or looking directly at the camera versus peoplelooking at unidentified objects, people that have vacant stares, etc.).In some implementations, in the instance where the user consents to theuse of such data, the media selector module 206 performs facialrecognition to identify the people in the media item and determineposture, gaze direction, and emotion of the people (e.g., based on afacial pose that indicates smiling, surprised, crying, etc.). In someimplementations, the time-diversity rule that determines that targetmedia for a segment include time diversity (e.g., the target media arenot selected from a single event), a shared media rule that determineswhether the media is shared from a media album, a burst for mix rulethat determines a quality of a burst for different types of media, aburst for animation rule that determines a quality of a burst foranimation, a burst for face switching that determines a quality of aburst that includes different faces as part of a burst, a face affinityrule that determines an affinity between the user associated with themedia item and a person in the media item (e.g., in the instance wherethe user and the person consent to the use of such data, the affinitymay be based on communications between the user and the person, such asa number of emails, texts, chats, etc. shared between the user and theperson), and a face clustering rule that determines clustering of peoplein a media album (e.g., in the instance where the user and the personconsent to the use of such data, the media selector module 206 mayperform facial recognition to identify the people in the media).

In some implementations, the media selector module 206 randomly selectsone of the rules from the plurality of rules according to a configuredprobability. For example, the media selector module 206 may use aprocess that employs a random or pseudorandom number generationtechnique to determine a rule to select from a group of rules. The mediaselector module 206 may determine whether the media item satisfies therule. If the media item satisfies the rule, the media item may beselected as target media. If the media item does not satisfy the rule,the media selector module 206 may randomly select a subsequent rule. Themedia selector module 206 may continue this process until a thresholdnumber of rules are selected, there are no longer subsequent rulesavailable, or the media item satisfies one of the subsequent rules. Therandom selection process may be advantageous over a process wheremultiple rules are applied all at once because some of the rules maycontradict each other. For example, the important event rule may resultin selecting multiple media items from the same event. Conversely, thetime-diversity rule may prohibit selecting multiple media items from thesame event.

In some implementations, the media selector module 206 may rank therules based on importance and select rules to apply to a media itembased on the importance of the rules. For example, the media selectormodule 206 may determine, from decreasing importance, that the order ofthe rules is the important entity rule, the important person rule, theimportant event rule, the social engagement rule, the time diversityrule, and the high-visual quality rule. In some implementations, themedia selector module 206 may assign a score to each rule, compute atotal score for each media item that is a weighted sum of the scores,and select the media items that are associated with the best scores(e.g., the highest scores if a higher score is indicative of a mediaitem that satisfies one or more rules). One disadvantage of the rankingapproach is that the media selector module 206 selects media items astarget media that somewhat satisfy many rules over media items thatgreatly satisfy one rule, which may result in mediocre results for thetarget media. In addition, it may be difficult to select appropriateweights for scoring the different types of rules.

In some implementations, the media selector module 206 selects a coverimage and an ending image for the video. For example, the media selectormodule 206 may identify media items that satisfy multiple rulesmentioned above. In some implementations, the media selector module 206may apply different criteria when selecting the cover image and theending image. In some implementations, the criteria when selecting thecover image and the ending image may be stricter than for the rest ofthe target media. For example, the cover image and the ending image mayhave to satisfy a first rule and subsequent rules, such as having animportant entity or landmark, an important person or a group photo wherethe user associated with the media items is in the image, people smilingin the image, and the visual quality may include a different qualitythreshold value. In some implementations, the quality threshold valuewhen selecting the cover image and the ending image may be stricter thanfor the rest of the target media. In some implementations, the mediaselector module 206 selects landscape images or images that may becropped to form a landscape image for the cover image and the endingimage.

The video generation module 208 may be configured to generate a videofrom the target media. In some implementations, the video generationmodule 208 may be a set of instructions executable by the processor 235to generate the video. In some implementations, the video generationmodule 208 may be stored in the memory 237 of the computing device 200and can be accessible and executable by the processor 235.

In some implementations, the video generation module 208 generatesportions of a video from target media for each segment and combines theportions in sequence to form the video. In some implementations, thevideo generation module generates a video that is a sequence of thetarget media.

The video generation module 208 may assemble the target media into avideo by creating animations of the target media that are arranged intoa sequence. In particular, instead of having a slideshow that displaysone photo and then fades into another photo, the video generation module208 may generate one or more animations that include transitions betweendifferent target media items. For example, the video generation module208 may generate an animation that includes a close-up of a first image,the animation then zooms out of the close-up of the first image to showadditional portions of the first image. The animation then zooms out ofthe first image a second time and then the animation includes atransition from the first image to a second image, where the secondimage is adjacent to the first image in the animation. The animationsare discussed in greater detail below with reference to FIGS. 3A-3D.Other transitions are possible, such as changing a perspective of thetarget media by zooming-in to emphasize important regions of the targetmedia or an illustration of the target media moving in a particulardirection (left-to-right, top-to-bottom, etc.).

In some implementations, the video generation module 208 generates avideo that includes a first animation that illustrates transitions froma first item from the target media to a second item from the targetmedia with movement of the first item from an onscreen location to anoffscreen location, where the first item is adjacent to the second item.The video generation module 208 may determine whether the target mediaincludes one or more additional items and, if there are additionalitems, the video generation module 208 may generate a subsequentanimation that illustrates a subsequent transition from the second itemto an additional item. The subsequent transition may further illustrateadditional transitions between a remainder of the one or more additionalitems.

The video generation module 208 may add a title to the video, such as atitle that describes the particular time period represented by thetarget media. For example, the video may be titled “Year in Review2014,” “Sara Is One,” “Daniel's Graduation,” etc. The video generationmodule 208 may determine a title for the video from a title of an album,in the instance where the user consents to the use of the data,determining a title based on calendar entries, etc.

The music module 210 may be configured to add a song to the video. Insome implementations, the music module 210 may be a set of instructionsexecutable by the processor 235 to add the song to the video. In someimplementations, the music module 210 may be stored in the memory 237 ofthe computing device 200 and can be accessible and executable by theprocessor 235.

In some implementations, the music module 210 may add a song to thevideo, where one or more features of the song are synchronized with thevideo. In some implementations, the features of the song may includebeats that are matched to the transitions. For example, the transitionfrom a first media item to a second media item may be synchronized witha beat of the video. In another example, a zoom-out effect on a firstitem in the target media may be synchronized with a beat of the video.

In some implementations, different types of beats may be matched withthe video. For example, transitions between target media items may bematched with a major beat in the song and transitions between differentperspectives of a target media item (e.g., zooming-out) may be matchedto a minor beat in the song. The features may also include differentvocal sections, instrumental sections, quiet sections, loud sections,male voice sections, female voice sections, etc.

In some implementations, the music module 210 may use a database ofsongs (e.g., the database 199 in FIG. 1, the storage device 243 of FIG.2, etc.) or retrieve songs from a third-party server that are identifiedto have features that synchronize with transitions in a video. Forexample, the music module 210 may identify songs for the database thathave consistent beats at predetermined intervals, songs that areinstrumental, etc. In some implementations, the music module 210 mayselect different songs based on a theme of the video. For example, themusic module 210 may determine that the theme of a video is “Mara's Tripto Paris” based a title associated with the video, landmark recognitionof the target media, etc. and match the Parisian theme with aFrench-inspired song. In some implementations, the music module 210 maysynchronize any song selected by a user to the video.

The social network module 212 may be configured to manage a socialnetwork. In some implementations, the social network module 212 may be aset of instructions executable by the processor 235 to manage the socialnetwork. In some implementations, the social network module 212 may bestored in the memory 237 of the computing device 200 and may beaccessible and executable by the processor 235. In some implementations,the social network module 212 may be excluded from the video application103.

A social network can be a type of social structure where the users maybe connected by a common feature. The common feature may includerelationships/connections, e.g., friendship, family, work, an interest,etc. The common features may be provided by one or more socialnetworking systems including explicitly defined relationships andrelationships implied by social connections with other online users,where the relationships form a social graph that may be stored in thestorage device 243. In some examples, the social graph can reflect amapping of these users and how they can be related. As a result, thesocial network may encompass, for example, social network applications,mail applications, chat applications, and search applications.

In the instance where the user consents to the use of such data, thesocial network module 212 may post the video to the user's social feed.In addition, where users have consented to the use of such data, thesocial network module 212 may identify other people in the video thatare members of the social network. For example, the social networkmodule 212 may receive the identity of the members from the mediaselector module 206. The social network module 212 may associate theidentity of the members with the members' profiles. The social networkmodule 212 may then tag the members in the video and notify the membersthat they were tagged in the video.

The user interface module 214 may be configured to provide informationto a user. In some implementations, the user interface module 214 can bea set of instructions executable by the processor 235 to provide thefunctionality described below for providing information to a user. Insome implementations, the user interface module 214 can be stored in thememory 237 of the computing device 200 and can be accessible andexecutable by the processor 235.

In some implementations, the user interface module 214 may generate auser interface that allows a user to edit the video. For example, theuser interface may include options that allow a user to edit the video.In some examples, the user may edit the video by adding a user-specifiedmedia item to the target media and removing one or more media items fromthe target media. In some examples, the user may edit the video byspecifying an order of the target media in the video, editing the targetmedia (cropping a photo, applying an enhancement to a burst, etc.), andreordering the target media in the video. In some examples, the user mayedit the video by changing the song, selecting the song, and modifyingthe volume of the song. In some examples, the user may edit the video bychanging the title. In instances where the user edits the video, themedia selector module 206 may select replacement target media (ifapplicable), the video generation module 208 may update the video, andthe music module 210 may update synchronization of the song with theupdated video.

The user interface module 214 may generate a link to a network locationfor a user or other users (e.g., users of the social network) to accessthe video. For example, the user interface module 214 may generate auniform resource locator (URL) that the user may share with other users,a landing page that any user may visit to obtain their own personalizedvideo, etc. The user interface module 214 may further generate one ormore options to share the network location, such as an emailnotification, a mobile callout, a share by link, an automaticnotification to other users that are directly connected to the user inthe social network, a post to be displayed on the user's social network,etc. In some implementations, the user interface module 214 may generatean option to download the video.

In some implementations, the user interface module 214 may generate apreview of the video that includes highlights of the video. For example,the preview may include the cover image, the end image, and 12 targetmedia items that are static images and that satisfy the rules. In someimplementations, the target media for the preview may satisfy a morestrict set of criteria than the target media for the video. For example,the target media for the preview may be selected from the best scoredtarget media from the video. In some implementations, the preview may beautomatically added to a user's media library (on the cloud, on theirphone, etc.).

Example Graphic Representations of Animations

FIGS. 3A-3D illustrates graphic representations of an animationgenerated by the video generation module 208. FIG. 3A illustrates agraphic representation 300 of a first portion of an animation. In thisexample, the graphic representation 300 is a close-up of a user 305 anda friend 307 of the user. Because the graphic representation 300 is acover image, the graphic representation 300 satisfies multiple rules asdetermined by the media selector module 206. For example, the mediaselector module 206 determines that the cover image includes the user,the cover image includes a person with a high face affinity score basedon frequent communications between the person and the user (with consentfrom the user), the cover image includes an important landmark 309, andthe quality of the image meets a threshold quality value.

FIG. 3B illustrates a graphic representation 325 of a second portion ofthe animation. In this example, the animation includes a firsttransition from the close-up of the user 305 and the friend 307 asillustrated in FIG. 3A to a zoomed-out version of the cover image asillustrated in FIG. 3B. In this graphic representation 325 the importantlandmark 309 is identifiable as the Eiffel Tower.

FIG. 3C illustrates a graphic representation 350 of a third portion ofthe animation. In this example, the animation includes a secondtransition from the close-up of the user 305, the friend 307, and theimportant landmark 309 as illustrated in FIG. 3B to a zoomed-out versionof the cover image as illustrated in FIG. 3C. The graphic representation350 also includes a title for the cover image that indicates that thesubject matter of the video is the year in photos.

FIG. 3D illustrates a graphic representation 375 of a fourth portion ofthe animation. In this example, the animation includes a thirdtransition from the cover image 380 to a second item 385. The thirdtransition is illustrated as the cover image 380 being adjacent to thesecond item 385 with movement from the right to the left as the coverimage 380 moves from an onscreen location to an offscreen location. Thesecond item is illustrated as including another friend 387 of the userin front of another important landmark 389, which the media selectormodule 206 identified as the Zakim Bridge in Boston.

In some implementations, the first two transitions may be synchronizedto a different feature of the song than the third transition. This maybe advantageous, for example, for the user to distinguish betweendifferent types of transitions, such a zoom-out transition (e.g., thefirst two transitions) and a horizontal movement (e.g., the thirdtransition).

Example Methods

FIG. 4 is a flowchart of an example method 400 to generate a video. Themethod 400 may be implemented by server 101, a user device 115 or acombination of the server 101 and the user device 115, using the videoapplication 103 illustrated in FIG. 1. The video application 103 mayinclude the segmentation module 204, the media selector module 206, thevideo generation module 208, and the music module 210 illustrated inFIG. 2.

At block 402, media items associated with a user are grouped intosegments based on a timestamp associated with each media item and atotal number of media items. For example, the segmentation module 204may group the media items associated with the user into segments basedon the timestamp data and the total number of media items. In someimplementations, if the timestamp data and the total number of mediaitems indicate a time diversity in the media items, the segmentationmodule 204 may group the media items into segments where a single eventrepresented in the media items are part of one of the segments. Forexample, images of a user's birthday party may be associated with asingle segment. In other implementations, if there is a lack of timediversity in the media items, a single event represented in the mediaitems may span two or more segments. For example, a user's trip to Parismay be associated with two segments.

At block 404, target media is selected from the media items for each ofthe segments based on media attributes associated with the mediaattributes associated with the media item, the target media includingtwo or more media items. For example, the media selector module 206 mayselect the target media. The target media may be selected based on rulesas described with reference to FIG. 5.

At block 406, a video is generated that includes the target media foreach of the segments. For example, the video generation module 208 maygenerate the video. The video includes animations as described ingreater detail below with reference to FIG. 6.

At block 408, a song is added to the video, where one or more featuresof the song are synchronized with the video. For example, the musicmodule 210 may add the song to the video. In some examples, the one ormore features may include beats that are synchronized to transitions inthe video.

While blocks 402 to 408 are illustrated in a particular order, otherorders are possible with intervening steps. In some implementations,some blocks may be added, skipped, or combined.

FIG. 5 is a flowchart of an example method 500 to select target mediafrom media items. The method 500 may be implemented by server 101, auser device 115 or a combination of the server 101 and the user device115, using the video application 103 illustrated in FIG. 1. The videoapplication 103 may include the media selector module 206 illustrated inFIG. 2.

At block 502, a rule may be randomly selected from a plurality of rules.For example, the media selector module 206 may randomly select the rulefrom the plurality of rules. The rule may be randomly selected accordingto a probability associated with the rule (e.g., a weight associatedwith each of the rules).

At block 504, it is determined whether the media item satisfies therule. For example, the media selector module 206 determines whether themedia item satisfies the rule. The rule may be satisfied by applying ascore to the media item based on media attributes being analyzed basedon the rule and determining whether the score exceeds a threshold value.Responsive to a determination that the media item does not satisfy therule, the method 500 proceeds to block 508 where it is determinedwhether there is a subsequent rule from the plurality of rules. Forexample, the media selector module 206 determines where there is asubsequent rule. Responsive to a determination that the subsequent ruleis present, the method 500 proceeds to block 502. Responsive to adetermination that there is not a subsequent rule from the plurality ofrules, the method 500 proceeds to block 510.

Responsive to the media item satisfying the rule, the method 500proceeds to block 510. At block 510, it is determined whether there areadditional media items. For example, the media selector module 206determines whether there are additional media items. Responsive todetermining that there are additional media items, the method 500proceeds to block 506. Responsive to determining that there are notadditional media items, the method 500 proceeds to block 406 of FIG. 4.

While blocks 502 to 510 are illustrated in a particular order, otherorders are possible with intervening steps. In addition, blocks may beadded, skipped, or combined.

FIG. 6 is a flowchart of an example method 600 to generate an animationwith two or more transitions. The method 600 may be implemented usingthe video application 103 illustrated in FIG. 1. The video application103 may include the video generation module 208 illustrated in FIG. 2.

At block 602, a first animation is generated that illustrates a firsttransition from a first item from the target media to a second item fromthe target media with movement of the first item from an onscreenlocation to an offscreen location, where the first item is adjacent tothe second item in the first animation. For example, the videogeneration module 208 generates the first animation.

At block 604, it is determined whether the target media includes one ormore additional items. For example, the video generation module 208determines whether the target media includes the one or more additionalitems. Responsive to a determination that the target media does notinclude the one or more additional items, the method 600 proceeds toblock 408 of FIG. 4. Responsive to the target media including the one ormore additional items, the method 600 proceeds to block 606.

At block 606, a subsequent animation is generated that illustrates asubsequent transition from the second item to an additional item. Forexample, the video generation module 208 generates the subsequentanimation. The method proceeds to block 604 and continues in a loopuntil there are no longer additional items in the target media.

In the above description, for purposes of explanation, numerous specificdetails are set forth in order to provide a thorough understanding ofthe specification. It will be apparent, however, to one skilled in theart that the disclosure can be practiced without these specific details.In some instances, structures and devices are shown in block diagramform in order to avoid obscuring the description. For example, theimplementations can be described above primarily with reference to userinterfaces and particular hardware. However, the implementations canapply to any type of computing device that can receive data andcommands, and any peripheral devices providing services.

Reference in the specification to “some implementations” or “someinstances” means that a particular feature, structure, or characteristicdescribed in connection with the implementations or instances can beincluded in at least one implementation of the description. Theappearances of the phrase “in some implementations” in various places inthe specification are not necessarily all referring to the sameimplementations.

Some portions of the detailed descriptions above are presented in termsof algorithms and symbolic representations of operations on data bitswithin a computer memory. These algorithmic descriptions andrepresentations are the means used by those skilled in the dataprocessing arts to most effectively convey the substance of their workto others skilled in the art. An algorithm is here, and generally,conceived to be a self-consistent sequence of steps leading to a desiredresult. The steps are those requiring physical manipulations of physicalquantities. Usually, though not necessarily, these quantities take theform of electrical or magnetic data capable of being stored,transferred, combined, compared, and otherwise manipulated. It hasproven convenient at times, principally for reasons of common usage, torefer to these data as bits, values, elements, symbols, characters,terms, numbers, or the like.

It should be borne in mind, however, that all of these and similar termsare to be associated with the appropriate physical quantities and aremerely convenient labels applied to these quantities. Unlessspecifically stated otherwise as apparent from the following discussion,it is appreciated that throughout the description, discussions utilizingterms including “processing” or “computing” or “calculating” or“determining” or “displaying” or the like, refer to the action andprocesses of a computer system, or similar electronic computing device,that manipulates and transforms data represented as physical(electronic) quantities within the computer system's registers andmemories into other data similarly represented as physical quantitieswithin the computer system memories or registers or other suchinformation storage, transmission, or display devices.

The implementations of the specification can also relate to a processorfor performing one or more steps of the methods described above. Theprocessor may be a special-purpose processor selectively activated orreconfigured by a computer program stored in the computer. Such acomputer program may be stored in a non-transitory computer-readablestorage medium, including, but not limited to, any type of diskincluding floppy disks, optical disks, ROMs, CD-ROMs, magnetic disks,RAMs, EPROMs, EEPROMs, magnetic or optical cards, flash memoriesincluding USB keys with non-volatile memory, or any type of mediasuitable for storing electronic instructions, each coupled to a computersystem bus.

The specification can take the form of some entirely hardwareimplementations, some entirely software implementations or someimplementations containing both hardware and software elements. In someimplementations, the specification is implemented in software, whichincludes, but is not limited to, firmware, resident software, microcode,etc.

Furthermore, the description can take the form of a computer programproduct accessible from a computer-usable or computer-readable mediumproviding program code for use by or in connection with a computer orany instruction execution system. For the purposes of this description,a computer-usable or computer-readable medium can be any apparatus thatcan contain, store, communicate, propagate, or transport the program foruse by or in connection with the instruction execution system,apparatus, or device.

A data processing system suitable for storing or executing program codewill include at least one processor coupled directly or indirectly tomemory elements through a system bus. The memory elements can includelocal memory employed during actual execution of the program code, bulkstorage, and cache memories which provide temporary storage of at leastsome program code in order to reduce the number of times code must beretrieved from bulk storage during execution.

In situations in which the systems discussed above collect personalinformation, the systems provide users with an opportunity to controlwhether programs or features collect user information (e.g., informationabout a user's social network, social actions or activities, profession,a user's preferences, or a user's current location), or control whetherand/or how to receive content from the server that may be more relevantto the user. In addition, certain data may be treated in one or moreways before it is stored or used, so that personally identifiableinformation is removed. For example, a user's identity may be treated sothat no personally identifiable information can be determined for theuser, or a user's geographic location may be generalized where locationinformation is obtained (such as to a city, ZIP code, or state level),so that a particular location of a user cannot be determined. Thus, theuser may have control over how information is collected about the userand used by the server.

What is claimed is:
 1. A computer-implemented method to generate avideo, the method comprising: grouping media items associated with auser into segments; selecting target media from the media items for eachof the segments based on media attributes associated with the mediaitems, wherein the target media includes two or more of the media items;generating the video that includes the target media for each of thesegments by: illustrating a first transition from a first media itemfrom the target media to a second media item from the target media withmovement of the first media item from an onscreen location to anoffscreen location; determining whether the target media includes one ormore additional media items; and responsive to determining that thetarget media includes one or more additional media items, illustrating asecond transition from the second media item to a first additional itemand that further illustrates additional transitions between a remainderof the one or more additional media items; and adding a song to thevideo, wherein one or more features of the song are synchronized withthe video.
 2. The method of claim 1, wherein: grouping the media itemsassociated with the user into segments is based on a respectivetimestamp associated with the media items; and the respective timestampassociated with each media item represents at least one of an uploadtime associated with uploading the media item, a capture time associatedwith capture of the media item, a creation time associated with themedia item, or a last modification time associated with the media item.3. The method of claim 1, wherein grouping the media items associatedwith the user into segments is based on the media items including a faceof the user and at least one additional face.
 4. The method of claim 1,wherein illustrating the first transition further comprises:illustrating a zoom-out effect on the first media item, wherein thezoom-out effect is synchronized to the one or more features of the song.5. The method of claim 1, wherein grouping the media items associatedwith the user into the segments includes grouping media items associatedwith an event into two or more segments based on an event number ofmedia items associated with the event as compared to a total number ofmedia items.
 6. The method of claim 1, wherein selecting the targetmedia from the media items for each of the segments based on the mediaattributes associated with the media items includes selecting based on ashare tendency associated with each of the media items.
 7. The method ofclaim 1, wherein grouping the media items is based on one or more of themedia items being associated with a landmark score that meets athreshold value, wherein the landmark score that meets the thresholdvalue indicates that an important landmark is depicted in the one ormore of the media items.
 8. A system to generate a video, the systemcomprising: one or more processors coupled to a memory; a media selectormodule stored in the memory and executable by the one or moreprocessors, the media selector module executable by the one or moreprocessors to: group media items associated with a user into segments;select target media from the media items for each of the segments basedon media attributes associated with the media items, wherein the targetmedia includes two or more of the media items; a video generation modulestored in the memory and coupled to the media selector module, the videogeneration module executable by the one or more processors to: generatethe video that includes the target media for each of the segments by:illustrating a first transition from a first media item from the targetmedia to a second media item from the target media with movement of thefirst media item from an onscreen location to an offscreen location;determining whether the target media includes one or more additionalmedia items; and responsive to determining that the target mediaincludes one or more additional media items, illustrating a secondtransition from the second media item to a first additional item andthat further illustrates additional transitions between a remainder ofthe one or more additional media items; and a music module stored on thememory and executable by the one or more processors to add a song to thevideo, wherein one or more features of the song are synchronized withthe video.
 9. The system of claim 8, wherein: grouping the media itemsassociated with the user into segments is based on a respectivetimestamp associated with the media items; and the respective timestampassociated with each media item represents at least one of an uploadtime associated with uploading the media item, a capture time associatedwith capture of the media item, a creation time associated with themedia item, or a last modification time associated with the media item.10. The system of claim 8, wherein grouping the media items associatedwith the user into segments is based on the media items including a faceof the user and at least one additional face.
 11. The system of claim 8,wherein illustrating the first transition further comprises:illustrating a zoom-out effect on the first media item, wherein thezoom-out effect is synchronized to the one or more features of the song.12. The system of claim 8, wherein grouping the media items associatedwith the user into the segments includes grouping media items associatedwith an event into two or more segments based on an event number ofmedia items associated with the event as compared to a total number ofmedia items.
 13. The system of claim 8, wherein selecting the targetmedia from the media items for each of the segments based on the mediaattributes associated with the media items includes selecting based on ashare tendency associated with each of the media items.
 14. The systemof claim 8, wherein grouping the media items is based on one or more ofthe media items being associated with a landmark score that meets athreshold value, wherein the landmark score that meets the thresholdvalue indicates that an important landmark is depicted in the one ormore of the media items.
 15. A non-transitory computer storage mediumencoded with a computer program, the computer program comprisinginstructions that, when executed by one or more computers, cause the oneor more computers generate a video by performing operations comprising:grouping media items associated with a user into segments; selectingtarget media from the media items for each of the segments based onmedia attributes associated with the media items, wherein the targetmedia includes two or more of the media items; generating the video thatincludes the target media for each of the segments by: illustrating afirst transition from a first media item from the target media to asecond media item from the target media with movement of the first mediaitem from an onscreen location to an offscreen location; determiningwhether the target media includes one or more additional media items;and responsive to determining that the target media includes one or moreadditional media items, illustrating a second transition from the secondmedia item to a first additional item and that further illustratesadditional transitions between a remainder of the one or more additionalmedia items; and adding a song to the video, wherein one or morefeatures of the song are synchronized with the video.
 16. The computerstorage medium of claim 15, wherein: grouping the media items associatedwith the user into segments is based on a respective timestampassociated with the media items; and the respective timestamp associatedwith each media item represents at least one of an upload timeassociated with uploading the media item, a capture time associated withcapture of the media item, a creation time associated with the mediaitem, or a last modification time associated with the media item. 17.The computer storage medium of claim 15, wherein grouping the mediaitems associated with the user into segments is based on the media itemsincluding a face of the user and at least one additional face.
 18. Thecomputer storage medium of claim 15, wherein illustrating the firsttransition further comprises: illustrating a zoom-out effect on thefirst media item, wherein the zoom-out effect is synchronized to the oneor more features of the song.
 19. The computer storage medium of claim15, wherein grouping the media items associated with the user into thesegments includes grouping media items associated with an event into twoor more segments based on an event number of media items associated withthe event as compared to a total number of media items.
 20. The computerstorage medium of claim 15, wherein selecting the target media from themedia items for each of the segments based on the media attributesassociated with the media items includes selecting based on a sharetendency associated with each of the media items.